Transductive Learning via Model Selection; Can Overfitting be Exploited?

نویسندگان

  • Lior Wolf
  • Sayan Mukherjee
چکیده

A novel transductive learning algorithm is proposed, which is based on the use of model selection. In its simplest form there are k possible labels, m labeled points and one unlabeled point. One model is built for each possible classification of the unlabeled point yM+1 = Li, i = 1, ..., k, using all m+1 points and m + 1 labels. Any standard model selection criterion can then be applied to select one of the k models. The algorithm simply chooses the label Li that produced that model. We define the algorithm, show statistical justifications for it, and experimentally show the effectiveness of the algorithm when using a simple linear model combined with a model selection based on cross-validation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Transductive Learning via PAC-Bayesian Model Selection

We study a transductive learning approach based on clustering. In this approach one constructs a diversity of unsupervised models of the unlabeled data using clustering algorithms. These models are then exploited to construct a number of hypotheses using the labeled data and the learner selects an hypothesis that minimizes a transductive PACBayesian error bound, which holds with high probabilit...

متن کامل

Effective transductive learning via objective model selection

This paper is concerned with transductive learning. We study a recent transductive learning approach based on clustering. In this approach one constructs a diversity of unsupervised models of the unlabeled data using clustering algorithms. These models are then exploited to construct a number of hypotheses using the labeled data and the learner selects an hypothesis that minimizes a transductiv...

متن کامل

Analysis and Improved Recognition of Protein Names Using Transductive SVM

We first analyzed protein names using various dictionaries and databases and found five problems with protein names; i.e., the treatment of special characters, the treatment of homonyms, cases where the protein-name string may be a substring of a different protein-name string, cases where one protein exists in different organisms, and the treatment of modifiers. We confirmed that we could use a...

متن کامل

Transductive Learning via Model Selection

A novel transductive learning algorithm is proposed, which is based on the use of model selection. In its simplest form there are k possible labels, m labeled points and one unlabeled point. One model is built for each possible classification of the unlabeled point yM+1 = Li, i = 1, ..., k, using all M + 1 points and M + 1 labels. Any standard model selection criterion can then be applied to se...

متن کامل

Transductive Classification via Local Learning Regularization

The idea of local learning, classifying a particular point based on its neighbors, has been successfully applied to supervised learning problems. In this paper, we adapt it for Transductive Classification (TC) problems. Specifically, we formulate a Local Learning Regularizer (LL-Reg) which leads to a solution with the property that the label of each data point can be well predicted based on its...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005